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Bacteroides neonati strain MS4\ is the type strain of Bacteroides neonati sp. nov., a new spe- 
cies within the genus Bacteroides. This strain, whose genome is described here, was isolated 
from a premature neonate stool sample. B. neonati strain MS4^ is an obligate anaerobic 
Gram-negative bacillus. Here we describe the features of this organism, together with the 
complete genome sequence and annotation. The 5.03 Mbp long genome exhibits a Gh-C 
content of 43.53% and contains 4,415 protein-coding and 91 RNA genes, including 9 rRNA 
genes. 



Introduction 

Bacteroides neonati strain MS4t (= CSUR P 1500= 
DSM 26805), is the type strain of Bacteroides 
neonati sp. nov., and a new member of the genus 
Bacteroides. This bacterium is a Gram-negative, 
anaerobic, non spore-forming, indole positive ba- 
cillus that was isolated from a preterm neonate 
stool sample, during a study prospecting stool 
samples from patients with necrotizing 
enterocolitis and controls [unpublished]. 

To define a new bacterial species or genus, the 
"gold standard" method is the DNA-DNA hybridi- 
zation and G+C content determination [1]. How- 
ever, those methods are expensive, and poorly re- 
producible. The development of PGR and sequenc- 
ing methods led to new ways of classifying bacte- 
rial species, using in particular 16S rDNA se- 
quences with an internationally-validated cutoff 
value [2]. More recently, new bacterial genera and 
species are described using high throughput ge- 
nome sequencing and mass spectrometric anal- 
yses, which allow access to a wealth of genetic and 
proteomic information [3,4]. We propose the de- 
scription of a new bacterial species, using genome 
sequences, MALDI-TOF spectra, and the main 
phenotypic characteristics, as previously done [5- 
22]. 

Here we present a summary classification and a 
set of features for B. neonati sp. nov. strain MS4t 



(= GSUR P 1500= DSM 26805) together with a de- 
scription of the complete genomic sequencing and 
annotation. These characteristics support the cir- 
cumscription of a novel species, B. neonati sp. nov., 
within the Bacteroides genus. 

The Bacteroidaceae family is currently comprised 
of 3 genera: Acetomicrohium, Anaerorhabdus and 
Bacteroides. It is a heterogeneous family, grouping 
anaerobic and morphologically variable bacteria, 
and it is defined mainly on the basis of phyloge- 
netic analyses of 16S rDNA sequences. The most 
closely related species to Bacteroides neonati sp. 
nov. is Bacteroides graminisolvens [23] followed 
by Bacteroides intestinalis [24]. Bacteroides 
neonati is a strictly anaerobic Gram negative, non 
spore-forming bacterium. 

Classification and features 

A stool sample was collected from a patient during 
a case-control study analyzing the fecal microbiota 
of premature neonates with necrotizing 
enterocolitis, using MALDI-TOF and 16S rRNA 
gene sequencing [unpublished]. After collection in 
Marseille, the specimen was preserved at -80°C. 
Strain MS4t [Table 1) was isolated in October 
2012, by anaerobic cultivation on 5% sheep 
blood-enriched Columbia agar (BioMerieux, Marcy 
I'Etoile, France). This strain exhibited a 94% nu- 
cleotide sequence similarity with Bacteroides 
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graminisolvens [23] and a 94% nucleotide se- 
quence similarity with Bacteroides intestinalis 
[24]. Those similarity values are lower than the 
threshold recommended to delineate a new spe- 



cies without carrying out DNA-DNA hybridization 
[38]. In the inferred phylogenetic tree, it forms a 
distinct lineage close to Bacteroides 
graminisolvens [Figure 1). 



Table 1. Classification and general features of Bacteroides neonati strain MS4^ according to the MIGS 
recommendations [25] 



MIGS ID 


Property 


Term 


Evidence code" 






Domain Bacteria 


TAS [26] 






Phylum Bacteroidetes 


TAS [27,28] 






Class Bacteroidia 


TAS [27,29] 




Current classification 


Order Bacteroidales 


TAS [27,30] 






Family Bacteroidaceae 


TAS [31,32] 






Genus Bacteroides 


TAS [31,33-36] 






Species Bacteroides neonati 


IDA 






Type strain MS4 


IDA 




Gram stain 


Negative 


IDA 




Cell shape 


Bacillus 


IDA 




Motility 


Non motile 


IDA 




Sporulation 


Non spore-forming 


IDA 




Temperature range 


Mesophile 


IDA 




Optimum temperature 


37°C 


IDA 


MIGS-6.3 


Salinity 


Weak growth on BHI medium + 1% NaCI 


IDA 


MIGS-22 


Oxygen requirement 


Anaerobic 


IDA 




Carbon source 


Unknown 


NAS 




Energy source 


Unknown 


NAS 


MIGS-6 


Habitat 


Gut 


IDA 


MIGS-15 


Biotic relationship 


Free living 


IDA 




Pathogenicity 


Unknown 






Biosafety level 


2 




MIGS-14 


Isolation 


Stool sample 


NAS 


MIGS-5 


Sample collection time 


October 2012 


IDA 


MIGS-4.1 


Latitude 


43.296482 


IDA 


MIGS-4.1 


Longitude 


5.36978 


IDA 


MIGS-4.3 


Depth 


Surface 


IDA 


MIGS-4.4 


Altitude 


0 above sea level 


IDA 



""Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report 
exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, 
isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). The- 
se evidence codes are from the Gene Ontology project [37]. If the evidence is IDA, then the property was 
directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgments. 
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-SsctefOKfes nordv WJ.1Q50 Q4R043017) 



Figure 1. Phylogenetic tree highlighting the position of Bacteroides neonati MS4^ relative to other type 
strains within the genus Bacteroides. GenBank accession numbers are indicated in parentheses. Se- 
quences were aligned using CLUSTALW, and phylogenetic inferences obtained using the maximum- 
likelihood method within the MEGA 4 software [39]. Numbers at the nodes are bootstrap values ob- 
tained by repeating the analysis 500 times the analysis to generate a majority consensus tree. The 
scale bar represents a 2% nucleotide sequence divergence. 



Seven different growth temperatures [23°C, 25°C, 
28°C, 32°C, 35°C, 37°C, 50°C) were tested; no 
growth occurred at 50°C, growth occurred be- 
tween 23° and 37°C, and optimal growth was ob- 
served at 37°C. 

Colonies are punctiform, medium-sized, grey, 
shiny and round on blood-enriched Columbia agar 
under anaerobic conditions using GENbag anaer 
[BioMerieux). Bacteria were grown on blood- 
enriched Columbia agar [Biomerieux) and in 
Trypticase-soy TS broth medium, under anaerobic 
conditions using GENbag anaer [BioMerieux). 
They also were grown under anaerobic conditions 



on BHI agar and on BHI agar supplemented with 
1% NaCl. Growth was achieved only anaerobically 
on blood-enriched Columbia agar and weakly on 
BHI agar as well as BHI agar supplemented with 
1% NaCl after 72h incubation. Gram staining 
showed plump non spore-forming Gram-negative 
bacilli (Figure 2). The motility test was negative. 
Cells grow anaerobically in TS broth medium have 
a mean wide of 0.681 |im [min = 0.323 ^m; max = 
0.878 [im] and a mean length of 2.165 |im [min = 
1.402; max = 2.951), as determined using electron 
microscopic observation after negative staining 
[Figure 3). 
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Figure 3. Transmission electron micrograph of B. 
neonati strain MS4\ using a Morgani 268D (Philips) at 
an operating voltage of 60kV. The scale bar represents 
1,000 nm. 



Strain MS4t exhibited catalase activity but no oxi- 
dase activities. Using API 2 OA, a positive reaction 
could be observed only weekly for Gelatinase. Us- 
ing Api Zym, a positive reaction was observed for 
alkaline phosphatase [40 nmol of hydrolyzed sub- 
strata), acid phosphatase [40 nmol), 
naphtolphosphohydrolase [20 nmol), esterase [20 
nmol), esterase lipase [5 nmol), alpha- 
galactosidase [5 nmol), beta-galactosidase [20 
nmol), beta-glucuronidase [30 nmol), beta- 
glucosidase [5 nmol), N-acetyl-beta- 



glucosaminidase [40 nmol) and alpha-fucosidase 
[5 nmol). Using Api rapid id 32A, a positive reac- 
tion was observed for alpha-galactosidase, alpha- 
glucosidase, N-acetyl-beta-glucosaminidase and 
alpha-fucosidase. Regarding antibiotic susceptibil- 
ity, Bacteroides neonati was susceptible to 
clavulanate-amoxicillin, imipenem and metroni- 
dazole. When compared to the representative spe- 
cies within the genus Bacteroides, B. neonati ex- 
hibits the phenotypic characteristics detailed in 
Table 2 [40]. 



Table2. Differential characteristics of Bacteroides neonati sp. nov., strain MS4^, B. graminisolvens 
strain DSM 19988\ and B. intestinalis strain DSM 17397^. 



Properties 


B. neonati 


B. graminisolvens 


B. intestinalis 


Cell wide (|jm) 


0.3-0.8 


0.4-0.6 


n.a. 


Cell long (|jm) 


1.4-2.9 


1 .2-4.5 


1-3 


Oxygen requirement 


Anaerobic 


Anaerobic 


Anaerobic 


Gram stain 


Negative 


Negative 


Negative 


Optimal growth temperature 


37°C 


35°C 


37°C 


Habitat 


Human 


Methanogenic reactor 


Human 


Enzyme production 








Indole 






+ 


Alkaline Phosphatase 


+ 




+ 


Urease 








Catalase 


+ 




n.a. 


Gelatinase 


+ 






Utilization of 








Glucose 




+ 


+ 


Mannose 




+ 


+ 


Lactose 




+ 


+ 


Raff i nose 




+ 


+ 



http://standardsingenomics.org 



797 



Bacteroides neonati MS4T 



Matrix-assisted laser-desorption/ionization time- 
of-flight [MALDI-TOF) MS protein analysis was 
carried out as previously described [41]. A pipette 
tip was used to pick one isolated bacterial colony 
from a culture agar plate, and to spread it as a thin 
film on a MTP 384 MALDI-TOF target plate 
(Bruker Daltonics, Germany). Ten distinct depos- 
its were done for strain MS4''' from ten isolated 
colonies. Each smear was overlaid with 2 of 
matrix solution (saturated solution of alpha- 
cyano-4-hydroxycinnamic acid) in 50% acetoni- 
trile, 2.5% tri-fluoracetic acid, and allowed to dry 
for five minutes. Measurements were performed 
with a Microflex spectrometer (Bruker). Spectra 
were recorded in the positive linear mode for the 
mass range of 2,000 to 20,000 Da (parameter set- 
tings: ion source 1 (ISI), 20kV; IS2, 18.5 kV; lens, 7 
kV). A spectrum was obtained after 675 shots at a 
variable laser power. The time of acquisition was 
between 30 seconds and 1 minute per spot. The 
ten MS4T spectra were imported into the MALDI 
Bio Typer software (version 2.0, Bruker) and ana- 
lyzed by standard pattern matching (with default 



parameter settings) against the main spectra of 
6,335 bacteria, in the Bio Typer database. The 
method of identification includes the m/z from 
3,000 to 15,000 Da. For every spectrum, 100 
peaks at most were taken into account and com- 
pared with the spectra in database. A score ena- 
bled the identification, or not, from the tested spe- 
cies: a score > 2 with a validated species enabled 
the identification at the species level; a score > 1.7 
but < 2 enabled the identification at the genus lev- 
el; and a score < 1.7 did not enable any identifica- 
tion. For strain MS4''', the best-obtained score was 
1.345, which is not significant, suggesting that our 
isolate was not a member of a known genus. The 
reference spectrum from strain MS4t (Figure 4) 
was added to our database. A dendrogram was 
constructed with the MALDI Bio Typer software 
(version 2.0, Bruker), comparing the reference 
spectrum of strain MS4 with reference spectra of 
26 bacterial species, all belonging to the order of 
Bacteroidetes. In this dendrogram, strain MS4t ap- 
pears as a separated branch within the genus 
Bacteroides (Figure 5). 




Figure 4. Reference mass spectrum from B. neonati strain MS4^. Spec- 
tra from 10 individual colonies were compared and a reference spec- 
trum was generated. 
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MSP Dendrogram 




BacterDides ureolyticus 9118677 4 
Bacteroides yreolyticus HU33416"PNU 
Bacteroides yreolyticus DSM 20703T DSM 
Bacteroides ureolyticus 9405371 
Bacteroides ureolyticus 9218695 
Bacteroides ureolyticus 9128593 
Bacteroides ureolyticus 9126693 
Bacteroides ureolyticus 8274759 3 
Bacteroides coagulans DSM 207D5T DSM 
Bacteroides wigatus PNU 71838 PNU 
Bacteroides wilgatus PNU 1536 5 PNU 
Bacteroides wilgatus HU40347 "2 PNU 
Bacteroides wigatus DSM 3289 DSM 
Bacteroides wigatus DSM 1447T DSM 
Bacteroides massiliensis DSM 17679TDSM 
Bacteroides pyogenes 9140244 
Bacteroides neparinolyticus 9302789 2 
Bacteroides heparinolyticus 9208882 
Bacteroides cellulosilyticus 10069242 
Bacteroides intestinaSs DSM 17393TDSM 
Bacteroides intestinalis SW20 PNU 
Bacteroides intestinalis 110706 E9 LUMC 
Bacteroides timonensis 
Bacteroides -.: _:„: 
Bacteroides cmlus [ 
Bacteroides o\fltus 53152 PNU 
Bacteroides o\fltus IBS MS 21 IBS 
Bacteroides OMtus HU3B898" 1 PNU 
Bacteroides OMtus 32456 1 PNU 
Bacteroides o\fltus 11483~2 PNU 
Bacteroides flnegoldii DSH 17565TDSM 
Bacteroides thetaiotaomicron 9075720 
Bacteroides thetaiotaomicron 9075717 
Bacteroides thetaiotaomicron DSM 2079T DSM 
Bacteroides thetaiotaomicron ATCC 29742 PNU 
Bacteroides thetaiotaomicron HU68218 5 PNU 
Bacteroides thetaiotaomicron CCM 6028 COM 
Bacteroides thetaiotaomicron CCM 6027 CCM 
Bacteroides thetaiotaomicron CCM 4713 CCM 
Bacteroides thetaiotaomicron DSM 2255 DSM 
Bacteroides thetaiotaomicron 71280 2 PNU 
Bacteroides thetaiotaomicron 53164TNU 
Bacteroides caccae DSM 19024T DSM 
Bacteroides caccae 103 PIM 
Bacteroides uniformis DSM 6597T DSM 
Bacteroides uniformis ATCC 8492TTHL 
Bacteroides uniformis HU33120 3 13 PNU 
Bacteroides uniformis 110706_F9tUMC 
Bacteroides neonatalis 
Bacteroides gallinarum DSM 18171T DSM 
Bacteroides eggerthii DSM 20697T DSM 
Bacteroides nordii 491 512 PNU 
Bacteroides salyersiae DSM 18765T DSM 
Bacteroides nordii DSM 18764TDSM 
Bacteroides salyersiae IBS MS 27 IBS 
Bacteroides salyersiae 110706 F7 LUMC 
Bacteroides salyersiae 110706"A7 LUMC 
Bacteroides nordii 22710 0377~Ana IBS 
Bacteroides stercoris DSU 19555TDSM 
Bacteroides eggerthii 
Bacteroides pyogenes DSM 19673 DSM 
Bacteroides pyogenes DSM 20612 DSM 
Bacteroides pyogenes DSM 2061 IT DSM 
Bacteroides lyogenes 995000355 LBK 
Bacteroides fragilis PNU R19811 PNU 
Bacteroides fraSilis PNU BF 6712 PNU 
Bacteroides fragilis DSM 9669 DSM 
Bacteroides fragilis DSM 2151T DSM 
Bacteroides fraSilis MB 9009 05 THL 
Bacteroides fragilis MB 5088 05 THL 
Bacteroides fraSilis HU26938-2 PNU 
Bacteroides fragilis CCM ASm CCM 
Bacteroides fraSilis CCM 4508 CCM 



) 500 
Distance Le\el 



Figure 5. Dendrogram based on the comparison of the B. 
trum, and 72 other species of the genus of Bacteroides. 

Genome sequencing and annotation 

Genome project history 

The organism was selected for sequencing be- 
cause it was isolated from a premature neonate 
stool sample as part of a study prospecting stool 

Table 2. Project information 



neonati strain MS4^ MALDI-TOF reference spec- 
samples from patients with necrotizing 
enterocolitis. 

The Genbank accession number is HG726019 - 
HG726036 and consists of 18 scaffolds with a total 
of 35 contigs. Table 3 shows the project infor- 
mation and its compliance with MIGS version 2.0 
standards. 



MIGS ID 


Property 


Term 


MIGS-31 


Finishing quality 


Non-contiguous finished 


MIGS-28 


Libraries used 


One 454 PE 3-kb library 


MIGS-29 


Sequencing platforms 


454 GS FLXh- Titanium 


MIGS-31. 2 


Sequencing coverage 


27.0 


MIGS-30 


Assemblers 


Newbler 2.8 


MIGS-32 


Gene calling method 


Prodigal 2.5 




Genbank ID 


HG726019 - HG726036 




Genbank Date of Release 


November, 201 3 


MIGS-13 


Source material identifier 


DSM 26805 




Project relevance 


Stool samples from patients with necrotizing enterocolitis 



http://standardsingenomics.org 



799 



Bacteroides neonati MS4T 



Growth conditions and DNA isolation 

Bacteroides neonati strain MS4t [= CSUR P 1500= 
DSM 26805], was grown on blood agar medium at 
37°C under anaerobic conditions. Eight petri dish- 
es were spread and resuspended in 5 xlOO|xl of G2 
buffer. A first mechanical lysis was performed us- 
ing glass powder in the Fastprep-24 Sample Prep- 
aration system [MP Biomedicals, USA) with 2x20 
second bursts. DNA was then incubated with lyso- 
zyme (30 minutes at 37°C] and extracted on a 
Bio Robot EZ 1 Advanced XL (Qiagen]. The DNA 
was then concentrated and purified on a Qiamp kit 
[Qiagen). The yield and the concentration were 
measured by the Quant-it Picogreen kit [Invitro- 
gen) on the Genios_Tecan fluorometer at 
15.7ng/nl. 

Genome sequencing and assembly 

A 3 kb paired end library was pyrosequenced on 
the 454 Roche Titanium. This project was loaded 
on a 1/4 region on PTP Picotiterplates. 5 |ig of 
DNA was mechanically fragmented with a 
Hydroshear device [Digilab, Holliston, MA, USA) 
with an enrichment size at 3-4kb. The DNA frag- 
mentation was visualized with an Agilent 2100 
Bio Analyzer on a DNA labchip 7500 with an aver- 
age size of 3.2 kb. The library was constructed ac- 
cording to the 454 Titanium paired end protocol 
supplied by the manufacturer. Circularization and 
nebulization were performed and generated a pat- 
tern with an optimal at 604 bp. After PGR amplifi- 
cation through 15 cycles followed by double size 
selection, the single stranded paired end library 
was then quantified on the Agilent 2100 
BioAnalyzer on a RNA pico 6,000 labchip at 
91pg/|iL. The library concentration equivalence 
was calculated at 2.76 x 10^ molecules/nL. The li- 
brary was stored at -20°G until used. 

The library was clonally amplified with 0.5 and 1 
cpb in 2 emPCR reactions in each condition with 
the GS Titanium SV emPCR Kit (Lib-L) v2. The 



yield of the emPCR was 10.46 and 11.53%, respec- 
tively, according to the quality expected by the 
range of 5 to 20% from the Roche procedure. 
790,000 beads were loaded on the GS Titanium 
PicoTiterPlates PTP Kit 70x75 sequenced with the 
GS Titanium Sequencing Kit XLR70 

The 454 sequencing generated 811,269 reads 
[180 Mb, coverage of 27.0) assembled into contigs 
and scaffolds using Newbler version 2.8 [Roche, 
454 Life Sciences) and Mira assembler v3.2 [42]. 
The obtained contigs were combined using the 
Opera software vl.2 [43] in tandem with GapFiller 
VI. 10 [44] to reduce the set. Finally, some manual 
refinements using CLC Genomics software v4.7.2 
[CLC bio, Aarhus, Denmark) were made. The ge- 
nome consists of 35 contigs inl8 scaffolds. 

Genome annotation 

Non-coding genes and miscellaneous features 
were predicted using RNAmmer [45], ARAGORN 
[46], Rfam [47], PFAM [48]. Open Reading Frames 
[ORFs) were predicted using Prodigal [49] with 
default parameters but the predicted ORFs were 
excluded if they spanned a sequencing GAP region. 
The functional annotation was achieved using 
BLASTP [50] against the GenBank database [51] 
and the Clusters of Orthologous Groups [COGs) 
database [52,53]. 

Genome properties 

The genome of B. neonati strain MS4t is estimated 
to be 5.03 Mb long with a G+C content of 43.53% 
[Figure 6 and Table 4). A total of 4,415 protein- 
coding and 91 RNA genes, including 9 rRNA genes, 
65 tRNA, 1 tmRNA and 39 miscellaneous other 
RNA were founded. The majority of the protein- 
coding genes were assigned a putative function 
[69.26%) while the remaining ones were annotat- 
ed as hypothetical proteins. The properties and 
the statistics of the genome are summarized in 
Table 4. The distribution of genes into COG func- 
tional categories is presented in Table 5. 
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I GC skew+ 
I GC skew- 



I rRNA 
tmRNA 



I misc RNA 
I tRNA 



Functional classes of genes according to COGs 
Cellular processes and signaling 

■ Cell cycle control, cell divisron. chromosome partitioning 
I C*ll w si l/mtrrtbrsne/^nw slope biiag«i^)^$i$ 

■ Cell motility 

Post-translational modification, protein turnover, and thaperones 
Signal transduction mechanisms 
B lntra'C«nular trafficdng. secrettDn. and vesi<:ular transport 
Defense mechaniBms 

Metabolism 

I Energy production and conversion 
H Amino acid transport and metabolism 
I NiJcl«otide transport and m eta bolls, mi 
I Carbohydrate trarsiport and mAtabolism 

Coenzyme transport and metabolism 
I Upi'^ rran^part ,aiid n^^^tabali^iri 

Inorganic ion transport and metabolism 

Secondary metabolites Isiosynthesis, transport, and catsbolism 



Informatjon storage and processing 


1 Translation, ribOSOmikl tttXiCCUr^ Snd biogenesis 




1 Transcription 




1 R^plicatiiari, recombinationi ^nd repair 




Poorly characterized 


1 Gene-ral function prediction only 




B Function unknown 





Figure 6. Circular representation of the Bacteroides neonati chromosome. Circles from the center to the outside: GC 
skew (green/purple), GC content (black), tRNA (dark red), rRNA (purple), tmRNA (blue), miscellaneous RNA (deep 
blue) on forward strand, genes on forward strand colored by COGs categories, scaffolds in alternative grays, genes on 
reverse strand colored by COGs, tRNA (dark red), rRNA (purple), tmRNA (blue), miscellaneous RNA (deep blue) on re- 
verse strand. 



Table 4. Nucleotide content and gene count levels of the genome 



Attribute 


Value 


% of total^ 


Genome size (bp) 


5,026,786 


100 


DNA coding region (bp) 


4,556,154 


90.63 


DNA G+C content (bp) 


2,188,298 


43.53 


Total genes 


4506 


100 


rRNA 


9 


0.18 


tRNA 


65 


0.1 


tmRNA 


1 


0.01 


miscRNA 


39 


0.05 


Protein-coding genes 


4415 


97.98 


Genes with function prediction 


3121 


69.26 


Genes assigned to COGs 


4303 


97.46 


"The total is based on either the size of the 


genome in base pairs 


or the total number 



of protein coding genes in the annotated genome 
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Table 5. Number of genes associated with the 25 general COG functional categories. 



Code 


Value 


% age^ 


Description 


J 


225 


4.73 


Translation 


A 


15 


0.31 


RNA processing and modification 


K 


280 


5.89 


Transcription 


L 


313 


6.58 


Replication, recombination and repair 


B 


10 


0.21 


Chromatin structure and dynamics 


D 


84 


1.77 


Cell cycle control, mitosis and meiosis 


Y 


3 


0.06 


Nuclear structure 


V 


90 


1.89 


Defense mechanisms 


T 


189 


3.97 


Signal transduction mechanisms 


M 


317 


6.66 


Cell wall/membrane biogenesis 


N 


35 


0.73 


Cell motility 


Z 


11 


0.23 


Cytoskeleton 


W 


0 


0 


Extracellular structures 


u 


135 


2.84 


Intracellular trafficking and secretion 


o 


187 


3.93 


Posttranslational modification, protein turnover, chaperones 


c 


271 


5.69 


Energy production and conversion 


G 


309 


6.49 


Carbohydrate transport and metabolism 


E 


300 


6.31 


Amino acid transport and metabolism 


F 


105 


2.21 


Nucleotide transport and metabolism 


H 


200 


4.2 


Coenzyme transport and metabolism 


1 


112 


2.35 


Lipid transport and metabolism 


P 


336 


7.06 


Inorganic ion transport and metabolism 


Q 


66 


1 .39 


r I J. 1 1 • J.I • J. J. 1 J. 1 I • 

Secondary metabolites biosynthesis, transport and catabolism 


R 


568 


11.93 


General function prediction only 


S 


486 


10.21 


Function unknown 




112 


2.35 


Not in COGs 



^The total is based on the total number of protein coding genes in the annotated genome. 

The draft genome sequence of Bacteroides neonati 
Insights into the genome sequence has a smaller size compared to the Bacteroides 

intestinalis [respectively, 5.03 Mb against 6.05 
We made some brief comparisons against Mb). The G+C content is very close to fiactero/des 

Bacteroides intestinalis DSM 17393 intestinalis [respectively, 43.53% and 42.8%). 

[ABJLOOOOOOGO) that is currently the closest Bacteroides neonati has slightly fewer genes 

available sequenced genome. This genome is com- (4,506 genes against 4,984 genes), and a higher 

posed of 8 contigs [ABJL02000001- ratio of genes per Mb [895.82 genes/Mb against 

ABJL02000008). 823.8 genes/Mb). 
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Table 6. Percentage of genes associated with the 25 general COG functional categories for B. neonati and B. 
intestinalis DSM 20548. 



Code 


B. neonati 
%aBe^ 


B. intestinalis 

%aee^ 


% Difference 


C^CyCi Hpsrrintinn 

Xw-V^VJ VlV.^«..l liyilvvll 


1 


4.73 


4.26 


0.47 


Tr<^nc;l;^f inn rihnc;nm.^l cifnirfiirp ;^nrl hiocrpnptjit; 


A 


0.31 


0.24 


0.07 


RNJA nrnrpt;t;in(7 ;^nrl morlifir;^tion 

l \ I > / \ U 1 \y\^\Zjj I 1 1 ti Cll 1 U II 1 i 1 l^Cl Ll\_/I 1 




5 8Q 


5 63 




Tr:^ n c:r~r intion 

1 IdllDL^I ILJLIWII 


L 


6.58 


6.21 


0.37 


Rpnlir.^finn rprnrnhin.^f ion .^nrl rpn;^ir 

IXvTlJ 1 1 *^Cl LI Wl 1 , 1 vIV^ W 1 1 lU 1 1 1 CI L 1 Wl 1 CI 1 1 ^-J 1 \I7L/0 1 1 




0 ?1 


0 ?1 


n 


IIIWIIIClLIII DLIULLUIC ClIILJ Uyilclllll*-.a 


n 


1 .77 


1 49 


W . Z. %J 


r^pll r~\/r~lp r~ontrol r~pll rli\/ic:ion pnromociomp n;^i'titionincj 

v^C II L^yL.IC LvJIILIvJI/ LCII UIVlDlvJII^ L.IIIvJIIICJDCJIIIC: LJCllLILIWIIIIIcl 


Y 


0 Oft 


0 0? 


0 04 


NJiir~lp;^i' Qtrt 1 r~tt 1 rp 

INUL^ICCll DLIULLUIC 


V 


1 .89 


2.46 


-0.57 


r)pfppcp mprhpini*^m*^ 

I _V V . J V . 1 J 1 1 I V . I- 1 JC4.I ll^l 1 1^ 


T 
1 


^ Q7 




-0 71 


Clonal tr:^ ncni ir^tion rnf^r^n:^ n icmc 


M 




7 46 


-0 8 


r^pll \A/:^ 1 l/mpmnr:^ np nioopnPQic: 

V_,v.7 1 1 VV ClII/llldllLJICllICr U 1 vjtid IC J 1 5 




0 73 


1.11 


-0 38 


r^pl 1 moti 1 it\/ 

V_,v., II lllvJLIIILy 


7 

i— 


0 ? 3 


0 1? 


Oil 


\/ to Q K P 1 pto n 


W 


0 


0.02 


-0.02 


Extracellular structures 


u 


2.84 


2.94 


-0.1 


1 nf r<^rp| 1 1 il,^ r tr;^ffirk'in(5 pnri tiprrption pnri vptiinilpr tr;^nc;nnrt 

IIILICtt^V^IIUIdl Lldlll^lXlllcl C1I1\J fi v.. 1 v^LI V / 1 1, CI 1 1 *J V J I LJ 1 CI 1 LIClll DLJ wl L 


o 


3.93 


3.45 


0.48 


Pnc;ffr;^nc;l;^f ionpl morlifirpfion nrofpin fiirnovpr rhpnpronpt; 

1 W jLLI CI 1 IjIClLI W 1 1 CI 1 III w \J II 1 d L 1 W 1 1 > yjl W Lv^ 111 L U 1 1 1 W V C 1 . 1 1 d d W 1 1 v; J 


c 


5.69 


4.58 


1.1 1 


Energy production and conversion 


G 


6.49 


8.77 


-2.28 


Carbohydrate transport and metabolism 


F 


6 31 


5 53 




Amino ;^r~in tr:^nQnort ;^nn mptpholiQm 


F 


2.21 


1 .97 


0.24 


Nucleotide transport and metabolism 


H 


4.2 


4 09 


n 1 1 


f"opn7\/mp tr';^ n Qnort :^nn mptpholiQm 

v_, V y V, 1 IZ.yiIIC LIdlljUvJIL dlHJ II ICLClL/W 1 1 Dl 1 1 


1 
1 






W. 1 <J 


1 iniri trancnorf" anri mf^t^^nol icm 

LIUIU Ll dl IJUvJI L dilU [ 1 IcLdUUl I d[ 1 1 


P 


7.06 


6.21 


0.85 


Inorganic ion transport and metabolism 


Q 


1.39 


1.37 


0.02 


Secondary metabolites biosynthesis, transport and catabolism 


R 


11.93 


12.62 


-0.69 


General function prediction only 


S 


10.21 


10 


0.21 


Function unknown 




2.35 


2.33 


0.02 


Not in COGs 



"The total is based on the total number of protein coding genes in the annotated genome. 
Table 6 presents the difference of gene number [in Conclusion 



percentage) related to each COG category between 
Bacteroides neonati and Bacteroides intestinalis. 
The proportion of COG is highly similar between 
the two species. The maximum difference is relat- 
ed to the COG "Carbohydrate Metabolism and 
transportation" which does not exceed 2.28%. 



On the basis of phenotypic, phylogenetic and ge- 
nomic analysis, we formally propose the creation 
of Bacteroides neonati that contains the strain 
MS4T. This bacterium has been found in Marseille, 
France. 
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Description of Bacteroides neonati sp, nov, 

Bacteroides neonati (neo.na'ti L. gen. masc. 
n. neonati, because this new species has been first 
isolated from a preterm neonate stool sample)is a 
Gram-negative bacillus; Obligate anaerobic; Non- 
spore-forming bacterium; Grows on axenic medi- 
um at 37°C in anaerobic atmosphere; Negative for 
indole; Non motile; The G+C content of the ge- 
nome is 43.53%. The type strain is MS4t [= CSUR 
P 1500 = DSM 26805). 
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